BiDaS: a web-based Monte Carlo BioData Simulator based on sequence/feature characteristics
نویسندگان
چکیده
BiDaS is a web-application that can generate massive Monte Carlo simulated sequence or numerical feature data sets (e.g. dinucleotide content, composition, transition, distribution properties) based on small user-provided data sets. BiDaS server enables users to analyze their data and generate large amounts of: (i) Simulated DNA/RNA and aminoacid (AA) sequences following practically identical sequence and/or extracted feature distributions with the original data. (ii) Simulated numerical features, presenting identical distributions, while preserving the exact 2D or 3D between-feature correlations observed in the original data sets. The server can project the provided sequences to multidimensional feature spaces based on: (i) 38 DNA/RNA features describing conformational and physicochemical nucleotide sequence features from the B-DNA-VIDEO database, (ii) 122 DNA/RNA features based on conformational and thermodynamic dinucleotide properties from the DiProDB database and (iii) Pseudo-aminoacid composition of the initial sequences. To the best of our knowledge, this is the first available web-server that allows users to generate vast numbers of biological data sets with realistic characteristics, while keeping between-feature associations. These data sets can be used for a wide variety of current biological problems, such as the in-depth study of gene, transcript, peptide and protein groups/families; the creation of large data sets from just a few available members and the strengthening of machine learning classifiers. All simulations use advanced Monte Carlo sampling techniques. The BiDaS web-application is available at http://bioserver-3.bioacademy.gr/Bioserver/BiDaS/.
منابع مشابه
Op-nare130420 582..586
BiDaS is a web-application that can generate massive Monte Carlo simulated sequence or numerical feature data sets (e.g. dinucleotide content, composition, transition, distribution properties) based on small user-provided data sets. BiDaS server enables users to analyze their data and generate large amounts of: (i) Simulated DNA/RNA and aminoacid (AA) sequences following practically identical s...
متن کاملA Monte Carlo-Based Search Strategy for Dimensionality Reduction in Performance Tuning Parameters
Redundant and irrelevant features in high dimensional data increase the complexity in underlying mathematical models. It is necessary to conduct pre-processing steps that search for the most relevant features in order to reduce the dimensionality of the data. This study made use of a meta-heuristic search approach which uses lightweight random simulations to balance between the exploitation of ...
متن کاملCharacteristics of lead glass for radiation protection purposes: A Monte Carlo study
Background: Lead glass has a wide variety of applications in radiation protection. This study aims to investigate some characteristics of lead glass such as the γ-ray energy-dependent mass and linear attenuation coefficients, the half-value layer thickness, and the absorbed dose distribution for specific energy. Materials and Methods: The attenuation parameters of different lead glass types aga...
متن کاملDetermination of Dosimetric characteristics of a New 192Ir-PDR Brachytherapy Source According to AAPM TG- 43 Protocol using Monte Carlo simulation technique
Introduction: 192Ir is one of the important sources frequently used in brachytherapy. Up to now, a lot of commercial models of this source have been made which Ir-192 has been recently added to them. The aim of the present study is to determine the dosimetric parameters of this new source model based on the recommendations of TG-43(U1) protocol using Monte Carlo simulation tech...
متن کاملA comparative Monte Carlo study on 6MV photon beam characteristics of Varian 21EX and Elekta SL-25 linacs
Background: Monte Carlo method (MC) has played an important role in design and optimization of medical linacs head and beam modeling. The purpose of this study was to compare photon beam features of two commercial linacs, Varian 21EX and Elekta SL-25 using MCNP4C MC code. Materials and Methods: The 6MV photon beams of Varian 21EX and Elekta Sl-25 linacs were simulated based on manufacturers pro...
متن کامل